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ABSTRACT 

We use the Who versus [N ii]/Hq; (WHAN) diagram introduced by us in previous work 
to provide a comprehensive emission-line classification of Sloan Digital Sky Survey galaxies. 
This classification is able to cope with the large population of weak line galaxies that do not 
appear in traditional diagrams due to a lack of some of the diagnostic lines. A further advan- 
tage of the WHAN diagram is to allow the differentiation between two very distinct classes 
that overlap in the LINER region of traditional diagnostic diagrams. These are galaxies host- 
ing a weakly active galactic nucleus (wAGN) and "retired galaxies" (RGs), i.e. galaxies that 
have stopped forming stars and are ionized by their hot evolved low-mass stars (HOLMES). 

A useful criterion to distinguish true from fake AGN (i.e. the RGs) is the value of 
which measures the ratio of the extinction-corrected Ha luminosity with respect to the Ha 
luminosity expected from photoionization by stellar populations older than 10 8 yr. We find 
that £ follows a markedly bimodal distribution, with a £ » 1 population composed by systems 
undergoing star-formation and/or nuclear activity, and a peak at £ ~ 1 corresponding to the 
prediction of the RG model. We base our classification scheme not on £ but on a more readily 
available and model-independent quantity which provides an excellent observational proxy for 
the equivalent width of Ha. Based on the bimodal distribution of Wh«, we set the practical 
division between wAGN and RGs at Who = 3 A. 

Five classes of galaxies are identified within the WHAN diagram: 

• Pure star forming galaxies: log[Nn]/Ha < -0.4 and Wn a > 3 A 

• Strong AGN (i.e., Seyferts): log[Nn]/Ha > -0.4 and W Ha > 6 A 

• Weak AGN: log[N u]/Ha > -0.4 and W^a between 3 and 6 A 

• Retired galaxies (i.e., fake AGN): Wn a < 3 A 

• Passive galaxies (actually, line-less galaxies): Who and W[Nirj < 0.5 A 

A comparative analysis of star formation histories and of other physical and observational 
properties in these different classes of galaxies corroborates our proposed differentiation be- 
tween RGs and weak AGN in the LINER-like family. This analysis also shows similarities 
between strong and weak AGN on the one hand, and retired and passive galaxies on the other. 

Key words: galaxies: active - galaxies: stellar content - galaxies: evolution - galaxies: statis- 
tics 



1 INTRODUCTION 

The goal of a classification scheme is to rationally organize a large 
collection of objects (animals, plants, books, galaxies) into a fixed 
number of smaller classes without leaving any object aside , in or- 
der to help dealing with this large collection. There are obviously 
many ways to classify objects, depending on what properties one 
is mostly interested in. For galaxies, one could, for example, be 
mainly interested in the morphology (Hubble 1936), colours (de 
Vaucouleurs 1960), spectral features (Morgan & Mayall 1957) etc. 



It is not only the considered object properties that change from one 
classification system to another, but also the philosophies underly- 
ing the classification process (hierarchical or non hierarchical, su- 
pervised or not supervised, . . . ) all of which have their advantages 
and drawbacks. Ideally, a classification scheme should be objective 
and allow easy incorporation of new objects into predefined cate- 
gories. The last criterion, for instance, is not fulfilled by the classifi- 
cations using principal component analysis (Sodre & Cuevas 1994; 
Connolly et al. 1995). Most importantly, the classification scheme 
must be useful, Hubble's famous tuning fork being a classical ex- 
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ample. Another one is the emission-line classification scheme pio- 
neered by Baldwin, Phillips, & Terlevich (1981) (see also Pastoriza 
1968), which has been widely used over the last three decades. 

The advent of large surveys of galaxies providing extensive 
collections of homogeneous data sets not easily tractable with con- 
ventional methods fostered new classification schemes based on 
automatic procedures such as neural networks (Folkes, Lahav, & 
Maddox 1996), k-means cluster analysis of entire galaxy spectra 
(Sanchez Almeida et al. 2010), or on human-eye analysis of galaxy 
morphology by hundreds of thousands of volunteers, as in the beau- 
tiful Galaxy Zoo project (Lintott et al. 2010). 

Emission-line classifications of galaxies are among the easiest 
to carry out (once the emission line intensities are measured) and 
allow one to deal with such issues as star formation, chemical com- 
position or nuclear activity. They have, for example, helped to gain 
insight into the nature of warm infrared galaxies (de Grijp et al. 
1992), of luminous infrared galaxies (Smith, Lonsdale, & Lons- 
dale 1998), of galaxies in compact groups (Coziol et al. 1998) or 
into the connection between active galactic nuclei (AGN) and their 
host galaxies (Kauffmann et al. 2003). The most famous emission- 
line diagnostic diagram is that based on the [O m]/15007/H/3 vs 
[N ii]/16584/Hq' line ratios 1 (dubbed the BPT diagram after Baldwin 
et al. 1981). This diagram was built to pin down the main source of 
ionization in the spectra of extragalactic objects, and became one of 
the major tools for the classification and analysis of emission line 
galaxies in the Sloan Digital Sky Survey (SDSS, York et al. 2000). 

As emphasized by Cid Fernandes et al. (2010, CF10), the 
BPT diagram leaves unclassified a large proportion of emission line 
galaxies in the SDSS, due to quality requirements on 4 emission 
lines. The Kewley et al. (2006) classification is even more demand- 
ing, as it requires the intensities of as many as 7 lines. To remedy 
this situation, CF10 have proposed a much more economic diagram 
that allows one to attain the same objectives using only two lines, 
Hey and [Nn], which are generally the most prominent in the spec- 
tra of galaxies. This is a diagram plotting the Ha equivalent width 
versus [N nl/Ho-, which was dubbed the EWHan2 diagram in that 
paper and which we will from now on refer to as the WHAN dia- 
gram, for simplicity 2 . The border lines between different classes of 
galaxies in the WHAN diagram were defined by optimal transpo- 
sitions of the Kauffmann et al. (2003), Stasinska et al. (2006, S06) 
and Kewley et al. (2006) border lines onto the Wu a vs [Nii]/Hq- 
plane. 

The categories of emission line galaxies considered by Kauff- 
mann et al. (2003) and Kewley et al. (2006) comprised star forming 
galaxies (SF), Seyferts, and LINERs 3 This latter category is sup- 
posed to comprise objects hosting low-level nuclear activity. How- 
ever, as demonstrated by Stasinska et al. (2008, S08), the LINER 
region in the BPT diagram also contains galaxies that have stopped 
forming stars and are actually ionized by the hot evolved low-mass 



In the remaining of this paper, [Om]/t5007 and [Nn]/16584 will be de- 
noted [Om] and [Nil], respectively. 

2 Our WHAN diagram is, in a way, very similar to the W[m ii] vs [N ii]/Hq- 
diagram of Coziol et al. (1998). However, the WHAN diagram is easier 
to interpret, since it concerns two physically independent quantities, W\\ a 
measuring the amount of ionizing photons absorbed by the gas relative to 
the stellar mass, while [N ii]/H<? is a function of the nitrogen abundance, of 
the ionization state and temperature of the gas. 

3 In those papers, LINERs should have rather been called "galaxies with 
LINER-like spectra", given that LINER is the abbreviation of "low ioniza- 
tion nuclear emission regions" (as originally introduced by Heckman 1 980), 
and the SDSS spectra cover much more than the nuclear regions of galaxies. 



stars (HOLMES) contained in them. This idea is not new. It was 
already put forward to explain the emission line spectrum of el- 
liptical galaxies (Binette et al. 1994; Macchetto et al. 1996) and 
even earlier in different contexts (Hills 1972; Terzian 1974; Lyon 
1975; Sokolowski & Bland-Hawthorn 1991). This idea is recently 
gaining support (Schawinski et al. 2010; Masters et al. 2010; Kavi- 
raj 2010), especially with the detailed study of nearby galaxies 
(Sarzi et al. 2010; Annibali et al. 2010; Eracleous, Hwang, & Flohic 
2010). As shown by CF10, when taking into account the "forgotten 
population" of weak line galaxies, the proportion of such "retired 
galaxies" (RGs) increases dramatically. It is therefore essential to 
find ways to distinguish these RGs from truly "active" galaxies, i.e. 
galaxies which contain a (weak) AGN. This is one of the goals of 
the present paper. 

Our second goal is to link the universe of emission line galax- 
ies (ELGs) with the universe of galaxies without emission lines (of- 
ten dubbed "Passive Galaxies", PG). We will see that RGs and pas- 
sive galaxies are in fact very similar objects. 

With our WHAN diagram, we will thus be able to provide a 
comprehensive classification of all SDSS galaxies, including PGs. 

The paper is structured as follows: Section 2 presents the 
samples of galaxies used to work out our classification and our 
starlight data base from which the properties of galaxies that we 
use are extracted. Section 3 lists the initial definition of galaxy 
classes: SF, Seyferts, LINERs and PGs. Section 4, the core of this 
paper, sorts out RGs from AGN among LINER-like galaxies and 
proposes a practical criterion to identify them. Section 5 describes 
our comprehensive classification of galaxies based on the WHAN 
diagram and leading to five classes: SF, strong AGN, weak AGN, 
RG and PG. Section 6 examines the star formation histories of 
galaxies within our new classification scheme, placing RGs in the 
context of the galaxy population in general. Section 7 discusses the 
distributions of a series of physical and observed properties in the 
different galaxy classes, to test the pertinence of our new classifi- 
cation. Finally, Section 8 summarizes our main results. 



2 SAMPLES AND DATA PRODUCTS 

We draw from the nearly 700 thousand galaxies from the Main 
Galaxy Sample (Strauss et al. 2002) in the 7 th data release of 
the SDSS (Abazajian et al. 2009). Besides the SDSS data, we 
make use of a suite of STARLiGHT-derived products available at 
www.starlight.ufsc.br, like stellar masses, stellar extinctions and 
the full star formation histories (Cid Fernandes et al. 2005). Emis- 
sion line properties are measured after subtraction of the underlying 
stellar component determined with the starlight fit, as described in 
Mateus et al. (2006). 

Two samples are used throughout this work. The "full sample" 
(hereafter "sample F") is selected by requiring (i) a signal-to-noise 
ratio > 10 in the continuum around 4750 A, (ii) no faulty pixels 
within ±15 A of Hey and [Nn], (Hi) z < 0.17 to eliminate the few 
objects at higher redshift which survive the previous criterion, and 
(iv) z > 0.04 to minimize apertures effects. In total, sample F con- 
tains 379330 galaxies. 

Sample V is built from a volume limited sample satisfying 
0.04 < z < 0.095 and absolute r-band magnitude M r < -20.43, to 
which we further apply criteria (i) and (ii) above. These latter cuts 
imply only a 6 percent reduction, with little loss of completeness 
at the faint end, so this sample is effectively limited in volume. 
Sample V comprises 152749 objects. By choosing z < 0.095 we 
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eliminate cases where Ha and [N n] fall in a region of strong sky 
lines, so that criterion (ii) is fulfilled. 

These samples fulfill the basic requirements for this work. 
Condition ( i) ensures meaningful spectral fits, necessary for stellar 
population analysis and the production of templates to aid emission 
line measurements, while (ii) ensures that the detection of Ha and 
[N n] (the most important lines in this study) is not hampered by ar- 
tifacts like bad pixels or sky residuals. At the same time, neither of 
these samples is biased for or against the presence of emission lines, 
which allows us to treat systems with strong, weak or no emission 
lines on an equal footing. 

Sample F has the advantage of being more numerous and of 
extending to lower luminosities, while sample V is more adequate 
for demographic studies and unbiased comparisons among sub- 
samples. The novelties in this work, namely the criterion to sepa- 
rate true from fake AGN and the resulting revision of conventional 
spectral taxonomy, are mainly related to the most luminous galax- 
ies, so that, in practise, we could use either sample V or F. In what 
follows, unless stated otherwise, we use sample V. 



3 PRELIMINARY GALAXY CLASSES 

We sub-divide galaxies into different classes according to their 
emission line properties. The first division is among galaxies with 
and "without" emission lines, denoted ELGs and PGs, respectively. 

3.1 Passive galaxies 

PGs are defined as those with very weak or undetected emission 
lines. Galaxies are deemed passive if the equivalent widths (W A ) of 
both Ha and [N n] fall below 0.5 A, a criterion which applies to 19 
(20)% of sample V (F). Since these are the two strongest optical 
lines, this criterion automatically implies that other lines will be 
even weaker in general (e.g. Brinchmann et al. 2004; CF10). 

In the literature one often finds PGs and ELGs defined in terms 
of limits on the signal to noise ratio (SN A ) of emission lines (e.g. 
Miller et al. 2003; Brinchmann et al. 2004; Mateus et al. 2006). We 
consider a W A -based criterion more appropriate, as it is based on 
a more direct measurement of the line strength and less explicitly 
dependent on the quality of the data. In addition W/s have an as- 
trophysical meaning. We emphasize that the adopted limiting value 
of 0.5 A is not really meaningful. As will be seen later, none of 
the discussion presented in this paper would have changed if had 
we adopted a slightly different limit. We also note in passing that 
the denomination "Passive" for "line-less" systems is adopted just 
for compatibility with current nomenclature, with no evolutionary 
connotation as the word would suggest. 

3.2 Emission Line Galaxies: SF, Seyferts and LINERs 

ELGs are defined as everything else, i.e., any source where either 
Wn a or W[nii] is > 0.5 A. Notice that this stretches the definition of 
an ELG to its limit, since, in extreme cases, only one of Ha or [N n] 
is detected! Sample V (F) has 124410 (303194) ELGs, 94 (92)% of 
which have both lines stronger than 0.5 A. 

ELGs are normally subdivided into SF and AGN-like classes 
on the basis of diagnostic diagrams involving at least 4 lines, like 
the BPT diagram ([OirfJ/H/? versus [Nn]/Ha). As shown in CF10, 
the weakness of either or both of H/3 and [O m] implies that such 
traditional classification schemes leave large numbers of ELGs un- 
classified. This problem is particularly severe in the right wing of 
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Figure 1. The WHAN diagram, used to establish emission line classes. The 
line labelled S06 7 represents the optimal transposition of the S06 SF/AGN 
spectral classification scheme, while the line at Wh<i = 6 A represents the 
transposed version of the Kewley et al. (2006) Seyfert/LINER class divi- 
sion. Dotted lines mark W\\ a = 0.5 and W[nii] = 0.5 A limits, below which 
line measurements are uncertain. Passive galaxies (W\\ a and W[nh] < 0.5 
A, red dots) lie below the dashed line. Dots correspond to sample F and 
contours to sample V. Contours are drawn at number densities = 0.1, 0.2, 
0.4 and 0.6 of the peak value. As in other figures in this paper, only 1 tenth 
of the objects is plotted to avoid overcrowding. 



the BPT diagram, where AGN-like systems reside, but well over 
half of the galaxies have unusable H/3 and/or [O in] fluxes. In that 
same paper we introduced alternative diagnostic diagrams able to 
cope with galaxies with weak or undetected H/3 and/or [Ora], the 
most economic of which is Wu a versus [Nul/Hcr, hereafter the 
WHAN diagram. We base all our classifications on this diagram, 
which, unlike any other, allows us to classify all galaxies. 

The WHAN diagram is shown in Fig. 1. The vertical line 
at log[Nn]/HQ- = -0.40 corresponds to the optimal transposition 
of the S06 BPT-based SF/AGN division, designed to differentiate 
sources where star formation provides all ionizing photons from 
those where a harder ionizing spectrum is required. Similarly, the 
division at Wyia = 6 A represents an optimal transposition of the 
Kewley et al. (2006) division between Seyferts and LINERs on the 
basis of 7 emission lines. Unlike Kewley et al. (2006) and other 
studies (e.g., Kauffmann et al. 2003; Mateus et al. 2006), we do not 
define a "SF+ AGN composite" category, and so our Seyfert/LINER 
border line goes all the way to the SF/AGN frontier. (The reader is 
referred to CF10 for a discussion of the pros and cons of this dia- 
gram, as well as for a discussion of why we prefer not to define a 
composite category.) Sample V splits into percentage fractions of 
23, 21 and 37 SF, Seyferts and LINERs, respectively. PGs account 
for the remaining 19%. The corresponding fractions for sample F 
are very similar: 25, 19, 36 and 20%. 

The dotted lines in Fig. 1 correspond to min(W H », W[nii]) = 
0.5 A. Points below these lines (painted in orange or red) are thus 
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very uncertain 4 , but are nevertheless plotted to remind the reader of 
the existence of such objects. There is indeed some advantage in in- 
cluding line-less galaxies in an emission line classification scheme 
since the absence of emission lines is also an information! The re- 
gion below the dotted lines includes both ELGs with only one line 
> 0.5 A, as well as PGs (below the dashed lines, in red) for which 
we have Ho- and [N n] fluxes in our database. 



4 IDENTIFYING RETIRED GALAXIES 

Ideally, splitting ELGs into SF and AGN would separate systems 
whose emission lines are powered exclusively by young stars from 
those where accretion onto a central super-massive black hole dom- 
inates (or at least contributes to) the ionizing radiation field. How- 
ever, as reviewed in the introduction, this binary scheme does not 
account for a third phenomenon: photoionization by HOLMES. 
These Retired Galaxies (RGs) are numerous in large-scale surveys 
(S08), where they occupy LINER-like locations in the commonly 
used diagnostic diagrams, and are thus erroneously associated to 
non-stellar activity. 

This section investigates ways of identifying RGs. We start 
with a theoretically motivated definition, but end up with a very 
simple empirical criterion to diagnose RGs among ELGs. 



4.1 Ionizing photons budget: The f ratio 

By definition, RGs are systems whose old stellar populations suf- 
fice to explain the observed emission line properties. Since we al- 
ready know that nebulae photoionized by HOLMES cover most 
regions in intensity ratios diagrams (S08), we shall adopt an opera- 
tionally simple energy-budget criterion and define RGs as galaxies 
whose old stellar populations are capable of accounting for all of 
the observed Ho- emission . 

Let LSI be the intrinsic (extinction corrected) Ha luminosity 
of a galaxy, and L^(t > 10 8 yr) denote the luminosity expected 
from photoionization by populations older than 10 8 yr. The ratio 
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should thus be able to tell if a galaxy is retired (£, < 1) or whether 
some extra source of ionizing photons is present. Given the orders 
of magnitude difference between the ionizing photon rates of young 
and old populations, even tiny amounts of ongoing star formation 
suffice to swamp the ionizing field produced by HOLMES, leading 
to £ » 1 and to a SF-like emission-line pattern due to the softer 
ionizing spectrum. Similarly, <f will also grow above 1 in the pres- 
ence of an AGN. In fact, as long as a galaxy has enough gas to 
absorb all hv > 13.6 eV photons produced within it, £ should never 
drop below 1 . 

This way of identifying RGs was previously employed by S08. 
In what follows we open up the details and discuss the caveats re- 
lated to the estimation of ^. 



4 Our line fitting code, described in Mateus et al. (2006), only accepts fits 
with W,{ ^ 0.1 A, which produces the empty triangle in the bottom left part 



4.2 Computation of the expected Ha luminosity 

Neglecting escape and extinction of Lyman continuum radiation, 
the denominator in eq. 1 becomes 

hv Ha 



Ig(t > 10V) ^ 



Jb 



x <27(r > 10 s yr) 



(2) 



where fn a = 2.226 for case B hydrogen recombination, and <2™ p is 
the expected ionizing photon rate. The starlight spectral decompo- 
sition in terms of a base of simple stellar populations (SSPs) allows 
us to estimate Q^ p for each of our galaxies. The code returns the 
mass associated to each of N+ SSPs of different ages (t) and metal- 
licities (Z), so that the expected flux of ionizing photons from the 
populations of interest here is simply 



Gh P (? > 10 8 yr) = M, £ 



(3) 
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where M* is the total stellar mass formed, jl is the mass-fraction 
population vector, and q H j = q^{tj,Zj) is the number of H-ionizing 
photons emitted per unit time and initial mass for the / h SSP Since 
the expected Ha luminosity is to be compared to the one observed 
within the 3"aperture of the SDSS spectroscopic fiber, the stellar 
mass in this equation corresponds to that inside the fiber (i.e., we 
do not extrapolate to account for the whole galaxy). In fact, all our 
analysis up to Section 6 uses within-the-fiber data only. 

The starlight fits used in this work employ the same base of 
SSPs from Bruzual & Chariot (2003, hereafter BC03) described by 
Mateus et al. (2006). Naturally, the spectral fits are carried out in 
the optical range, but the BC03 models also provide predictions for 
the ionizing energies. This is a key element of our analysis, so it 
is worth to open a parenthesis to examine these predictions more 
closely. 



4.3 SSP models beyond 1 Rydberg 

Fig. 2 shows the evolution of SSP properties of interest to this work. 
Panels on the left (a-d) correspond to BC03 models of 6 different 
metallicities used in our analysis. These are discussed first. 

Fig. 2a shows the evolution of the hardness of the ionizing 
spectrum, as measured by the ratio of Hel to HI ionizing photons. 
The difference between the soft ionization field produced by OB 
stars (/ < 10 7 yr) and that produced by HOLMES is clearly visible. 
Panel b shows the specific ionizing photon rate, qu(t, Z), the most 
relevant quantity for our analysis. One sees the well known drop 
by ~ 5 orders of magnitude in qu from the HII region phase to the 
time when the HOLMES regime sets in at 10 8 yr and q H approxi- 
mately stabilizes at a level of the order of 10 41 s _1 M^ 1 . The specific 
continuum around Ha, cn„ (equivalent to the inverse of the mass to 
light ratio, Kjjo)> however, keeps decreasing with time (panel c) and 
therefore the predicted Wn a (bottom panel) goes through a mini- 
mum at 10 s yr and grows to 1.5-2.5 A at t > 10 10 yr. This is enough 
to explain a significant proportion of the ELGs in the LINER zone 
of Fig. 1. 

The right column of panels Fig. 2 shows the predictions ob- 
tained for other evolutionary synthesis models at solar metallicity 
and Chabrier (2003) initial mass function (IMF, the same one used 
for the BC03 models on the left panels). Red lines show results for 
a preliminary version of Chariot & Bruzual models. Blue lines are 
used for Popstar models (Molla, Garcia- Vargas, & Bressan 2009) 
with Lejeune, Cuisinier, & Buser (1997) library and NLTE mod- 
els by Rauch (2003) and "Padova 1994" tracks (version updated by 
Molla & Garcia- Vargas 2000). Green lines represent Pegase models 
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Figure 2. Evolution of the ratio between the He i and Hi ionizing photon rates (panel a), Hi-ionizing photon rate (b, in units of r'M s '), Ha continuum (c, in 
units of LqA^'Mq 1 ), and Ha equivalent width (d) for simple stellar populations in the BC03 models of metallicities between Z* = 0.005 and 2.5Z Q . Panels 
on the right are Z Q models from 5 different sources (including BC03 for comparison). See text for details. The right axis scale in panels c and g gives the mass 
to light ratio in the Ha continuum: K\\ a = cJV. (in units of MqLq'A). 



(Fioc & Rocca-Volmerange 1997) with their 200 A- 10 /jm stellar 
library (see their section 2.2.1 for more details). Two sets of evolu- 
tionary tracks are shown for Pegase: Light green is used for Bres- 
san et al. (1993) tracks up to TP-AGB, dark green shows the set for 
Schaller et al. (1992) and Charbonnel et al. (1996) tracks. Both sets 
use Schoenberner (1983) and Bloecker (1995) for the PAGB. For 
reference, the BC03 predictions for Z a are repeated (black lines). 

Despite a general qualitative agreement, there are significant 
quantitative differences. For ages above 10 8 yr, the hardness ra- 
tio varies from 10 to 20%, while qn differs by up to 1 dex from 
one model to another. Those differences could be due to differ- 
ences in stellar tracks, model atmospheres and even interpolation 
techniques. For Wn a , the greatest difference is above 10 9 yr for 
the BC03 models, which are ~ 0.5 A larger than the others. We 
have also compared different IMFs (e.g., Salpeter 1955 and Kroupa 
2001). For a given metallicity in a model, different IMFs result in 
variations from 0.2 up to 0.5 dex in qu and cn a , which is typically 
of the same order of variations for different models with the same 
IMF. 

For the sake of internal consistency with our starlight anal- 
ysis of SDSS galaxies, in what follows we adopt the predictions 
from BC03, but this quick compilation illustrates that there are still 
plenty of uncertainties in model predictions for the Lyman contin- 



uum of old stellar populations. Naturally, these uncertainties prop- 
agate to our estimate of f . Fortunately, there is an empirical alterna- 
tive to circumvent the uncertainties related to this choice (Section 
4.8). 

4.4 Extinction correction 

Let us now turn to the numerator of eq. (1). The caveat in this case 
is that a nebular extinction is needed to correct L?. bs to Li 1 ? 1 , which 

Ha Ha ' 

in turn requires good measurements of both Ha and H/3. While this 
is not a problem for ELGs in the SF and Seyfert categories, most 
of the LINER-like systems do not comply with this requirement 
because of their feeble H/?. 

One way to circumvent this difficulty is to estimate Hcy/H/J in 
terms of something else. To do this, we first select LINERs with re- 
liable Hd/H/?, and then calibrate the relation between nebular (AJf b ) 
and stellar (Ay) extinctions (as done by Asari et al. 2007 for SF 
galaxies), such that A v can be used to estimate AJf b . In practice, 
we impose Wn a and Wnp > 0.5 A, and Ha/HyS > 3 (such that 
A n y* > 0) 5 , which leaves us with only ~ 30% of the full list of LIN- 

5 For the Cardelli et al. (1989) law used in our analysis, A!Jf b = 
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ERs. Nebular and stellar extinctions are indeed correlated for this 
sub-sample (Spearman rank correlation coefficient R s = 0.38), al- 
beit with substantial scatter. A fit through the median relation yields 
A n y h = 1.56A* + 0.53, with typical residuals of ~ 0.5 mag. Objects 
used in this calibration have Ay b = 1.0 ± 0.5 (median ± semi inter- 
quartile range), while for LINERs as a whole the above relation 
leads to A n y b = 0.8 ± 0.2. This method most likely overestimates 
AJf b , as there is a trend of increasing A'Jf b with increasing Wn a , and 
Ho- is much weaker in the full sample than in the sub-set used to 
calibrate the A^ cb (A*) relation (Wh„ = 1.5 versus 3.7 A in the me- 
dian, respectively). 

Another alternative is to apply the extinction correction when 
the data allow (W H a and Wh/3 > 0.5 A, and Ha/H/3 > 3), and as- 
sume Ay = otherwise. 

These methods lead to estimates of £ which should bracket the 
correct solution. Since A l y b = 1 implies a 0.32 dex increase in Lgi, 
for the typical inferred A'Jf b values these two methods should differ 
by ~ 0.2-0.3 dex in £ . 

4.5 Results: The bimodal distribution of £ 

The £ values derived range from ~ 0.1 to 1000. This is illustrated 
in Fig. 3a, which shows the distribution of £ values for all ELGs in 
sample V. The distribution is strongly bimodal , with a high £ peak 
at ~ 30 and a lower one centred close to £ = 1. 

The two histograms in Fig. 3a differ in the recipe to correct 
^Ha f° r extinction when Ha/H/? is unreliable. The dotted curve is 
for no correction, whereas the solid one is for the A^ eb (A*) method 
discussed above. The curves are identical for £ > 10 because these 
are Seyfert and SF galaxies, with Ha and H/3 fluxes good enough 
to apply the standard correction. In the low £ peak the two methods 
differ by just 0.2 dex on the median, as anticipated. Considering 
the smallness of this difference and that these recipes are likely to 
bracket the range of possibilities, we henceforth use the average £ , 
whose distribution peaks at £ ~ 1 . 

Overall, the low £ population is unbelievably close to the ex- 
act prediction for RGs: £ = 1 ! But are these galaxies really retired 
in the sense of not having formed stars for a long while? To an- 
swer this question, Fig. 3b shows the histograms obtained selecting 
only galaxies whose populations younger than 10 8 yr contributes 
no more than Xg = 10% of the light at 4020 A. 6 Clearly, the low £ 
peak is made up of galaxies which have formed little or no stars in 
the past 10 s yr, in full agreement with the RG scenario of S08. 

The width of the RG peak in the £ distribution (as measured 
from the one-sided dispersion of its lower half, and assuming that 
the distribution is symmetric) is o"i og ^ ~ 0.24 dex for the full ELG 
sample or 0.13 dex eliminating those with WW < 0.5 A. From the 
considerations in the above sections, one would hardly expect £ to 
be more precise than a factor of 2, so that this observed dispersion 
can be accounted for by uncertainties in the data and the analysis. 

While theory says that RGs should have f < 1, the observed 
bimodality in the £ distribution suggests that £ < 2 or 3 is a more 

7.215 log (Hg^i) 01 * ■ We adopt an intrinsic Balmer decrement of 3, suitable 
for LINERs (S08) n .' 

6 To have an idea of what this means, Xg < 10% corresponds to specific 
star formation rates <, 2.5 X 10~ 12 yr -1 , or, equivalently, to less than 0.025 
percent of the stellar mass having formed in the past 10 8 yr. In fact, for light 
levels as low as Xg = 10% the spectral synthesis analysis is affected both 
by its own limitations and by side-effects of incompleteness in evolutionary 
synthesis models, such that the true contribution from young stars cannot 
be reliably estimated (see Section 6). 
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Figure 3. Histograms of the ratio between the intrinsic Ha luminosity L\\ a 
to that predicted to originate from photoionization by HOLMES (equation 
1 ) for ELGs in our volume limited sample, and for two choices of extinction 
correction for objects with unreliable Ha/H/?: estimate Ay Cb from A* (solid 
lines), and no-correction (dotted). Panel (a) includes all sources, while in 
(b) we select galaxies whose populations younger than 10 s yr contribute 
less than Xg = 10% to the continuum at A = 4020 A. Panel (c) shows the f 
distribution which results from requiring convincing detections of H/3 and 
[Om], as well as Ho- and [Nil]. 

effective way of separating RGs from the rest of the ELG popu- 
lation (Fig. 3a). About 1/3 of the full ELG sample has £ < 2. The 
reason why this huge population of RGs has received little attention 
to date is because standard selection criteria tend to exclude them 
from emission line studies. Fig. 3c illustrates how the distribution 
of £ changes by imposing a W A > 0.5 A quality control for the 4 
lines in the BPT diagram: H/?, [Om], Ho- and [Nil], This reduces 
the ELG sample as a whole by ~ 1/3, but the fraction of £ < 2 
sources decreases by a factor of 3. The bimodality is still present, 
but the low £ peak is heavily suppressed, so that RGs are the main 
victims of this cut. With the WHAN diagram we are able to res- 
cue RGs and weak line galaxies in general, eliminating this bias 
and thus drawing a much more complete view of ELGs in the local 
Universe. 

4.6 Results: £ versus [N ii\/Ha 

Fig. 4 shows the results of this analysis in a £ versus [N iil/Ha di- 
agram, color coding SF, Seyferts and LINERs according to their 
location on the WHAN diagram (Fig. 1). The plot shows that SF, 
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Figure 4. ELGs in the £ versus [N n]/Ha diagram, color coding PGs, LIN- 
ERS, Seyferts and SF, as in Fig. 1. Also as in Fig. 1, gray points are sources 
with either or both of Ha and [N n] weaker than 0.5 A. Solid contours are 
for the full sample V, while the dashed ones are for galaxies with less than 
X$ = 10% of their flux at A = 4020 A due to populations younger than 10 8 
yr (for clarity, only the two outer contours are plotted in this case). Dot- 
ted horizontal lines mark a range of x2 around the prediction of the retired 
galaxy model (£ = 1). 



Seyferts and LINERs are well separated in terms of £ and [N iil/Ha, 
with SF and Seyferts being responsible for the £ » 1 peak and 
LINERs for the £ ~ 1 RG-like population. The plot also illustrates 
how dangerous it is to define a galaxy as an AGN on the basis of 
[Nn]/Hff alone. Miller et al. (2003), for instance, define "2-lines 
AGN" as sources with log[Nii]/Hti' > -0.2 (similar to the "low 
S/N AGN" in Brinchmann et al. 2004), while our analysis suggests 
that most of this population does not require an energetically sig- 
nificant non-stellar source of ionizing photons. 

The dashed contours in Fig. 4 show the location of ELGs with 
X s < 10%. This clarifies that the bulk of the population of large 
£ sources among galaxies with a predominantly old stellar popu- 
lation (seen in the middle panel of Fig. 3) is composed of LIN- 
ERs approaching the Seyferts zone. Further considering that the 
X s cut eliminates sources with significant on-going star formation, 
this strongly indicates that these sources have a non-stellar source 
powering most of its line emission (i.e., they are AGN in old hosts). 

We note in passing a tiny population of galaxies with small £ 
(i.e. consistent with our RG model) but located to the left of the S06 
line in Fig. 4. These could correspond low-mass, low-metallicity 
galaxies in a quiescent phase between bursts of star formation, such 
as the galaxies discussed by Sanchez Almeida et al. (2008). 



4.7 £ versus WW 

An important aspect of Fig. 4 is its striking similarity to the 
WHAN diagram of Fig. 1. This happens because £ and WW are 
very closely related. The equivalent width of Ha; is defined as 
WW = L^/C^*, where the denominator is the observed contin- 
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Figure 5. Relation between £ and WW. The bottom panel (c) shows the 
average £ value discussed in Section 4.4, while in the middle one (b) no 
extinction correction correction was applied to Ln a . Points in cyan (or- 
ange) correspond to galaxies where Ha/H/3 is (is not) reliable. The black 
solid line panels b and c shows the median relation, while the dashed 
lines represent the 10 and 90% percentiles . In all projected histograms the 
black line represents all ELGs, while the filled region represents those with 
log[Nn]/Ha > -0.4 (i.e., to the right of the S06 r line in the WHAN dia- 
gram), and the dashed red line indicates PGs for which a (necessarily small 
and uncertain) Ha flux is available. The arrow in the WW distribution marks 
the proposed limit to separate RGs from other ELGs. 



uum luminosity around Ho- 7 . Hence, extinction corrections aside, 
WW and £ (equation 1) have the same numerator. 

Because q H (t,Z) varies little with t or Z after 10 8 yr (Fig. 2b 
and f), and since most of the mass is always in the old popula- 
tions (over 99% for our samples), Q^ p (t > 10 8 yr) in equation (3) 



7 Chq- is not to be confused with the continuum luminosity per unit mass, 
<-'Ha , introduced in Section 4.3. We use capital letters for extensive quantities 
(like CW and Qh), and lower case for specific ones (chk and qu)- 
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becomes ~ M+q^ d , where q^ d , the specific ionizing flux due to 
HOLMES, is of the order of 10 41 s~'M '. can be written in 
terms of the intrinsic continuum at Ho and the galaxy's luminosity 
weighted mass-to-light ratio: = CJJ 1 (ita,)/,, where K Ha = 
was previously discussed in Section 4.3. While ku„ spans nearly 3 
decades for SSPs between 10 6 and 10 10 yr (Fig. 2c), galaxies are not 
SSPs, and young (low k h „) populations rarely make a substantial 
contribution to the Ho- continuum, so that the luminosity weighted 
ku„ values are of the same order for most sources. For the V sample, 
for instance, log(K Hff )z. = 4.56 ± 0.14 (median ± semi-interquartile 
range), while sub-samples of SF, Seyferts, LINERs and PGs have 
4.35 ±0.1 2, 4.47 + 0.08, 4.65 ±0.08 and 4.70 ±0.06, respectively (in 
units of M G Lg 1 A). As a result, LS* is directly proportional to the 
Ha continuum, with little scatter due to variations in the star forma- 
tion histories, and thus the denominators of £ and WW essentially 
measure the same thing. 

With these approximations, one finds that 



WW ^ .4(A» b -A* ) 



f = —^10' 
u/old 
VV Ha 



(4) 



where the Ag* - A H(r term comes from the fact that £ accounts for 
possible differences in the nebular and stellar extinctions, whereas 
Wft a does not, and 



hv 



-^{KHahqu = 0-355A 

JBa 



10 4 



10 41 



(5) 



is the expected equivalent width for an old population (for (khi,}l 
in units of M G L" 1 A, and g° ld in s _I M Q '). 

Fig. 5 shows the observed relation between £ and WW- The 
bottom panel (c) shows the dust corrected £ value. Points in cyan 
and orange correspond to galaxies where Ha/H/3 is or is not reli- 
able, respectively. The correlation is very tight, as indicated by the 
10 and 90% percentile lines (in dashed). However, the differential 
extinction correction to £ creates a hump starting around W Htt ~ 2 
A, coinciding with the transition between Ho-/H/3-based corrections 
and those based on other methods (see Section 4.4). 

Fig. 5b shows the same relation assuming that Ho- and its 
continuum are equally affected by dust (A^ eb = A y ), so that the 
last term in equation (4) disappears, and so does the hump in Fig. 
5c. The relation is linear except at high WW, where it steepens 
because of the decrease in (kh«)l in galaxies with lots of star- 
formation. A fit through the median curve in its low Wu a end 
yields log£ obs = 1.07 log Wn a - 0.17. Forcing a linear fit leads to 
^Hiv = 0-67 A. For comparison, this value of is reached for 
ages of a few Gyr for the BC03 SSPs, or closer to 10 Gyr for other 
models (bottom panels in Fig. 2). 



4.8 Retired Galaxies: An empirical diagnostic 

This section started with the promise of finding a diagnostic to iden- 
tify RGs. The f ratio was introduced with this purpose, and we have 
seen that it is indeed capable of capturing in a single number the 
essence of RGs. However, there are caveats related to this index. 
Firstly, uncertainties related to the extinction correction affect £ at 
a level of ±0.2 dex. Secondly, and far more important, £ is explicitly 
dependent on evolutionary synthesis models, whose predictions for 
the ionizing fluxes from old stellar populations are still plagued by 
systematic uncertainties (Section 4.3, see also S08). Hence, while it 
is clear that RGs should populate the bottom of the £-scale, theory 
alone does not allow us to establish a firm upper limit below which 
a galaxy can confidently be considered as retired. A last caveat is 



that the evaluation of £, requires a stellar population analysis, which 
is not always available or possible. 

Fortunately, there is a simple empirical alternative to circum- 
vent these difficulties. 

As seen in Fig. 5a, and in agreement with previous studies 
(e.g, Bamford et al. 2008), the distribution of Wh» is strongly bi- 
modal, with a low peak centred at Wu a = 1 A, an upper one at 
16 A, and an intermediate minimum in the neighbourhood of 3 A. 
This happens both for the sample as a whole and for the sub-set of 
AGN-like galaxies, shown by the filled histogram. This bimodality 
strongly suggests that different mechanisms are at work: Photoion- 
ization by HOLMES on the low Wh,v side and by massive stars 
and/or AGN for large Wn„. One can therefore use the observed WW 
distribution to set an empirical bound for RGs. Of course, the same 
can be done for the £ distribution, but it is clearly advantageous 
to draw this boundary on the basis of an universally available and 
model-independent quantity: Wu a - 

We conducted various experiments to define an optimal 
boundary to separate the two peaks, all giving results between 3 
and 4 A. We settled for W H „ = 3 A. We thus propose to the follow- 
ing practical definition: RGs are ELGs with WW < 3 A. 

For Wu a < 3 A, the central black hole may well be active 
(i.e., accreting), but its ionizing photon output is comparable to or 
weaker than that produced by HOLMES. For Wj^ values between 1 
and 2 A, which are compatible with current models (bottom panels 
in Fig. 2), one finds that at our proposed Wh„ = 3 A borderline 
the AGN contributes between ~ 1/3 and 2/3 of the ionizing power. 
Naturally, this fraction decreases even more as W Ht , decreases, to 
the point that AGN contribution is negligible for the bulk of the 
low Wu a population. It is therefore not correct to use the emission 
lines of W Hff < 3 A systems to infer AGN properties. 



5 A COMPREHENSIVE CLASSIFICATION OF 
GALAXIES 

We are now able to separate fake AGN (= RGs) from true AGN. In 
the LINER zone of Fig. 1, true AGN are defined by 3 A< W H „ < 6 
A. To avoid confusion, we shall call these sources "weak AGN" 
(wAGN). In the interest of a consistent notation, we shall also re- 
name Seyferts as "strong AGN" (sAGN). Finally, we remove from 
the SF category those galaxies consistent with a RG classification, 
that is to say those having Wn a < 3 A. 

Our final classification scheme is shown in Fig. 6. Sample V 
sources split into percentage fractions of 22, 21, 8, 31 and 18 SF, 
sAGN, wAGN, RG and PG, respectively. RGs therefore exist in 
large numbers in the SDSS, in agreement with the basic prediction 
that such systems are bound to exits as a mere consequence of stel- 
lar evolution. AGN, on the other hand, are far less common than 
one would infer associating all LINERs to non-stellar activity. 

This revised classification scheme has three main virtues: 

(i) It identifies the three main ionizing agents in galaxies: young 
stars, AGN and HOLMES. 

(ii) It is based on the cheapest and therefore most inclusive di- 
agnostic diagram, the only one able to handle the huge population 
of weak line galaxies (CF10). In fact, it even allows us to visualize 
a region occupied by PGs, something which is not possible in con- 
ventional intensity ratio diagnostic diagrams (of course, galaxies 
with no emission lines at all do not appear in this diagram). 

(iii) The SF/AGN and wAGN/sAGN frontiers are based on opti- 
mal transpositions of widely used demarcation lines defined on the 
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Figure 6. The WHAN diagram with the revised categories: SF, sAGN, 
wAGN, RGs and PGs. Points with one of [Nil] or Ha weaker than 0.5 A 
are plotted in gray, except for PGs, which are plotted in red. 



basis of traditional diagnostic diagrams, such that the WHAN dia- 
gram preserves traditional classification schemes where they work 
well, while at the same time fixing their inability to deal with weak 
line galaxies and to distinguish true from fake AGN. 

Before exploring the realm of galaxies under the perspective 
of this new classification scheme, a few clarifications are in order. 

First, notice that our SF/AGN border line corresponds to the 
transposition of the S06 BPT-based criterion to isolate systems 
powered exclusively by young stars from those where a harder ion- 
izing source is required. As such, it would have been better to call 
it a "pure-SF" demarcation line, instead of a "SF/AGN" divisory 
line. This is all the more true now that we realize that the AGN-like 
population of ELGs in the BPT is full of retired galaxies whose 
black holes are not active enough to affect their emission lines. 

Secondly, notice that the Wn a < 3 A criterion slightly modifies 
our idealized definition of RGs as "systems whose old stellar pop- 
ulations suffice to explain the observed emission line properties" 
(Section 4.1) to one where HOLMES make at least a substantial 
contribution to the ionizing field. Such a fuzzy frontier is unavoid- 
able. Our criterion is designed to avoid RGs being misclassified as 
AGN. On the other hand, some galaxies hosting an AGN, partic- 
ularly those containing a weakly active nucleus, may drift to the 
Wua < 3 A zone if observed within an aperture larger than the 
SDSS 3" spectroscopic fiber. 

Numerical experiments show that this is indeed likely to hap- 
pen. Picking the Ho- luminosity of a randomly chosen AGN and 
adding it to the spectrum of a random bona fide RG with Wn a 
within 1 + 0.5 A, we find that about 1/3 of the resulting simulated 
spectra have W Ht , < 3 A, most of which in the 2-3 A range. We thus 
expect that a fraction of the systems here classified as RGs turn out 
to harbour weak AGN when looked up more closely. At first sight, 
this seems to be a drawback of using Wn„ for spectral classifica- 
tion, but it is not. After all, if an AGN is present and yet Wu a < 3 



A, then it is still correct to infer that old stars make a substantial 
contribution to the observed line emission. 

Finally, we note that the division of AGN into strong and weak 
categories on the basis of Wyi a is also to some extent aperture de- 
pendent, since sAGN may be diluted to wAGN as more stellar 
continuum is sampled. This, however, is a minor issue, since both 
classes are clearly AGN-dominated systems, so there is no risk of 
misidentifying the dominant ionizing agent. 



6 THE STAR FORMATION HISTORIES WITHIN OUR 
CLASSIFICATION SCHEME 

A popular approach to digest the flood of information from mega- 
surveys like the SDSS is to compare the properties of galaxies in 
different classes (SF vs. AGN, Seyfert vs. LINER, early vs. late 
type, field vs. cluster, etc.), in search of clues which may help 
understanding why galaxies are the way they are (see Blanton & 
Moustakas 2009 for examples of studies following this general 
strategy). Taxonomy, of course, plays a central role in such com- 
parative studies. In this section we present yet another compara- 
tive study, but this time following the new taxonomical framework 
proposed in the previous section and summarized in Fig. 6. The 
main novelty in this scheme is the introduction of the RG cate- 
gory, whose members were previously misclassified as AGN. A 
major goal of this section is to place RGs in the context of the 
galaxy population in general, which was never done before. On the 
one hand, we want to compare RGs with other ELGs, especially 
wAGN, with which RGs were so far confused. It is also interest- 
ing to compare wAGN to sAGN, since this division was merely 
introduced for the sake of "backwards compatibility" with the his- 
torical Seyfert/LINER sub-division of AGNs. On the other hand, 
it is equally important to compare RGs and PGs. According to the 
RG concept, galaxies without emission lines should not exist , since 
HOLMES provide a minimum floor ionizing field capable of pow- 
ering emission lines. Seen from this perspective, the existence of 
PGs raises questions like: Are they truly line-less systems or is it 
just an issue of detectability? Perhaps PGs lack the warm/cold ISM 
to reprocess ionizing photons? 

The starlight- SDSS database offers plenty of material for this 
comparative study. In what follows we use a suite of physical prop- 
erties as well as the detailed star formation histories (SFHs) to ad- 
dress the issues raised above. Only the volume limited sample is 
used from here on. 

An instructive way to start this comparative study is to investi- 
gate how SFHs vary as a function of the SF/sAGN/wAGN/RG/PG 
spectral types defined in Fig. 6. This is done in Fig. 7 where we 
chopped the WHAN diagram in 0.2x0.3 dex wide bins in [Nnl/Ha 
and W Ha . In each box we plot the time-dependent median of the 
specific star formation rate (SSFR) for galaxies in the box, colour 
coding the curves according to the Fig. 6 emission line taxonomy. 
The SSFR(t) functions were smoothed exactly as in Asari et al. 
(2007), which should be consulted for details on how this and other 
representations of STARLiGHT-based SFHs are derived. 

Because this is our classification diagram, most boxes con- 
tain only one curve, with exceptions at locations straddling the 
nominal frontier between RGs and wAGN (at Wn a = 3 A) and 
wAGN/sAGN (Wn„ = 6 A). Although there are strong systematic 
changes in SFHs across this plane, in these border-line cases one 
sees little difference between the SFHs of two adjacent classes. 

Several trends are visible in Fig. 7. For instance, focusing on 
the column centred at log[Nn]/HQ- = -0.1, one sees that both PGs 
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Figure 7. Star formation histories as a function of spectral class and the 
location in the WHAN diagram. Each box shows the median SSFR(t) for 
galaxies of different types, coded by different colours. The scale of these 
plots is shown in the inset. Curves are only drawn for boxes containing over 
200 galaxies. Only galaxies in the volume limited sample are used in this 
plot. Contours correspond to 0.1 and 0.6 of the peak density. 

and RGs have SFHs concentrated in the largest ages, while, moving 
upwards the retrieved SFHs reveal increasing levels of recent star- 
formation. Due to the large apertures covered by the SDSS fibers, 
most of the Ho- emission in these AGN probably comes from star- 
formation rather than the AGN (e.g. Kauffmann et al. 2003), so the 
increase in recent SSFR with W^ a is expected. However, the effect 
of the AGN is present, as can be inferred by the decreasing levels 
of recent SSFR towards the right for fixed Wn a (best visible in the 
rows centred at W Hff ~ 10 A). 

Before we proceed, a parenthesis must be opened to clarify the 
meaning of the small upturn in SSFR(t) at t < 10 7 yr for RGs and 
PGs. This feature is almost certainly an artifact of incomplete mod- 
elling of the horizontal branch phase (Koleva et al. 2008; Ocvirk 
2010; Cid Fernandes & Gonzalez Delgado 2010). This effect was 
quantified by Ocvirk (2010), who estimates that these "fake young 
bursts" account for up to 12% of the recovered optical light frac- 
tions in spectral synthesis of genuinely old stellar populations. In 
units of the SSFR, this corresponds to a few times 10~ 12 yr -1 , which 
is precisely the level at which upturns at small ages are seen in Fig. 
7. We further find a clear trend for this artefact to be related to the 
stellar metallicity, with the low t hump increasing for decreasing 
Z*, in agreement with the fact that bluer horizontal branches occur 
for lower (Harris 1996; see also Fig. 8 for indirect evidence of 
this trend). Finally, if the upturn were really due to young stars, W aa 
should be much larger than observed. Including these fake young 
bursts in the predictions for the ionizing flux leads to (in the me- 
dian) expected Wn a values over 3 times as large as observed for 
RGs. The overprediction for PGs is even worse, with a median ex- 
pected W Ka of 3.2 A, while their definition requires Wn a < 0.5 A. 

Credible levels of recent star-formation only appear for boxes 
centred at Wn a > 3 A in Fig. 7, coinciding with the RG/wAGN 
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Figure 8. As Fig. 7, but in the plane defined by the mean stellar surface 
density and stellar mass, /i* is computed dividing half the stellar mass by 
the radius which contains half the z-band light. 

transition. This suggests that all AGN are associated to at least 
some level of ongoing star-formation. While this is well known and 
documented for sAGN (the so called "starburst-AGN connection") 
the situation is much less clear for wAGN (e.g. Gonzalez Delgado 
et al. 2004 and references therein). It is difficult to compare this 
finding with those in previous studies because of varying defini- 
tions of wAGN, but we can confidently say that the signatures of 
star formation in wAGN identified in this work would be wiped out 
without removing the RGs from the AGN category. 

We now investigate how SFHs vary in terms of properties not 
explicitly involved in the definition of our spectral classes. Fig. 8 
shows SFHs in a stellar mass (M*) versus mass density (//*) dia- 
gram. This allows us to compare the SFHs of galaxies which are 
similar in terms of global properties, but differ in terms of emission 
line properties, since this time each box can contain galaxies of 
different classes. Again, several trends are evident to the eye. One 
of them is the signature of downsizing, here manifested by the de- 
crease in recent SSFR levels as one moves up the mass scale, seen, 
for instance, in the curves for SF galaxies in the 1 st two columns 
of boxes. A similar behaviour is also seen for RGs and PGs 8 , but 
shifted in time to t ~ 10 9 yr. 

For this study, the more relevant trend in Fig. 8 is that in every 
box the lines corresponding to our different classes appear in the 
same order, with recent star formation decreasing from SF galax- 
ies to sAGN, wAGN, RGs and PGs. These last two, in fact, have 
median SFHs which are very similar and sometimes indistinguish- 
able. Both types retired from forming stars over 10 s yr ago. The 

8 Recall that the "mini bursts" at ; < 10 7 yr are not real, but artifacts of 
spectral fits with an incomplete base (Ocvirk 2010). The fact that these fake 
bursts correlate with galaxy mass, as seen in Fig. 8, is likely an indirect 
consequence of the connection between horizontal branch morphology and 
Z* coupled to the M^-Z* relation. 
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Figure 9. As Fig. 7, but for the observational color magnitude diagram. 

plot also shows that the recent SSFR of wAGN never reaches lev- 
els as low as those of RGs and PGs. Despite their generally low 
levels of star formation compared to sAGN of same M* and /j*, 
wAGN are not retired, reinforcing the idea that star formation and 
AGN activity are connected even in the weakest cases, at least in a 
statistical sense. 

Fig. 9 shows another example. The plot shows the observa- 
tional color magnitude diagram, with the familiar red sequence, 
blue cloud and green valley pattern. This diagram is meant to pro- 
vide an empirical representation of the age-mass relation. As ex- 
pected, systems in the blue cloud are more actively forming stars 
nowadays than in their past average, while red sequence galaxies 
(with the exception of AGN) tend to be retired. Again, the SF to 
PG sequence in levels of recent SSFR and the similarity between 
RGs and PGs seen in Figs. 7 and 8 is present in every box. 

In summary, the extended emission line classification scheme 
proposed here bears an excellent correspondence with the SFHs of 
galaxies. RGs and PGs have very similar SFHs for similar global 
properties. wAGN are more similar to sAGN in terms of SFHs 
than to RGs or PGs. Traditional classification schemes which ig- 
nore RGs, would, on the contrary, find wAGN to be more like PGs. 



7 DISTRIBUTIONS OF GLOBAL GALAXY PROPERTIES 

Comparing the distributions of global properties of different galaxy 
classes can provide a further test of the pertinence of our classifi- 
cation scheme. This is done in Fig. 10, which shows normalized 
histograms of PG, RG, wAGN, sAGN and SF galaxies in terms of 
a series of properties related to their structure, stellar populations, 
gas and dust content. 

A striking feature of these histograms is that for most proper- 
ties PG and RG have similar distributions, reinforcing our interpre- 
tation that PG and RG are essentially the same thing, or at least that 
these classes overlap heavily. On the other hand, the distributions 
for wAGN are more similar to the ones of sAGN, suggesting that 



these galaxies belong to the same family. This is a salutary a pos- 
teriori validation of our splitting of LINERs into two very different 
classes: RGs and wAGN. 

Except for SF galaxies, all other galaxy types have masses 
within roughly ±0.5 dex of 10" M G (Fig. 10a). This is mostly due 
to the luminosity cut for the V sample, which imposes a mass com- 
pleteness limit of ~ 10 ia8 M G . In the median, Mf GN < M* KGn ~ 
M^ G < M^ G . In terms of surface densities and velocity dispersions 
(panels b and c), PGs and RGs are statistically similar, occupying 
the high end of the distributions. Despite the substantial overlap, 
it is clear that wAGN and sAGN tend to have smaller cr* and 
than RGs and PGs. The same is true for the escape velocity, com- 
puted from (GM t /R) 1 12 with R given by the half-light z-band radius 
(panel d). PGs and RGs also differ from AGN in terms of structural 
properties like the concentration index (e) and the SDSS deVAB_r 
index (f). Inspection of SDSS images corroborates the similarities 
between PGs and RGs, most of which are ellipticals, although some 
of the RGs turn out to be spirals (as can be deduced by the tail 
towards low CI in Fig. lOe), including an apparently higher than 
normal rate of edge-on systems. An investigation of this issue is 
beyond the scope of this paper, but we note that this "contaminant" 
population tends to have Wn a close to the RG/wAGN frontier, and 
thus may be related to the unavoidable fuzziness of classification 
schemes. 

In terms of luminosity-weighted mean stellar ages (Fig. lOg), 
sAGN are younger than wAGN, which in turn are younger than 
RGs and PGs. From Fig. 7 we infer that the tail of RGs towards 
smaller mean ages is due to systems at the RG/wAGN border. The 
mean stellar metallicities (panel h) are similar for all types, with a 
tail of sAGN reaching lower values. 

The distributions of stellar extinctions (panel i) show a dis- 
tinct PG < RG < wAGN < sAGN pattern. To double check this re- 
sult, panel 1 shows the distribution of HafH/3, but only for sources 
where this ratio is reliable, a criterion which excludes PGs and most 
RGs. Despite its incompleteness, Fig. 101 confirms that wAGN 
have higher extinction than RGs and lower than sAGN. The class 
with the lowest extinction is the SF class, in spite of the intense 
star formation which is usually associated with high extinction. The 
underlying reason for a smaller extinction in SF than the rest of 
galaxies lies in their lower masses (see Stasinska et al. 2004; Garn 
& Best 2010). Regarding the PG -> RG wAGN -> sAGN ex- 
tinction sequence, its interpretation is not straightforward, since it 
involves the dust mass and distribution, as well as its composition 
(dust grains recently produced by the progenitors of HOLMES and 
expected to be found in RGs do not have the same extinction prop- 
erties as grains processed in the interstellar medium and associated 
with regions of star formation). 

Fig. lOj shows the distribution of £ uncorrected for dust. The 
PG — > RG -» wAGN — > sAGN — > SF sequence reflects the central 
role of Wu a in our classification scheme. This sequence is mirrored 
in the distribution of observed Ho- luminosities (Fig. 10k). PGs in- 
cluded in these panels have, by definition, very weak Ho- (median 
WW = 0.23 A, and median SNn a = 2.4 counting only those with 
an Ha flux in our database). Still, their locations in these diagrams 
are consistent with them not being completely line-less. 



8 DISCUSSION AND SUMMARY 

The goal of this paper was to provide a comprehensive classifica- 
tion of galaxies according to their emission line properties which 
would be able to cope with the large population of weak line 
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Figure 10. Normalized histograms of properties for different galaxy classes. Colours code the SF, sAGN, wAGN, RG and PG classes as in Fig. 6. Horizontal 
bars trace the 16 to 84 percentile ranges, with the median marked by a circle, with the vertical shifts ordered as SF (top), sAGN, wAGN, RG and PG (bottom). 
All galaxies from our volume limited sample are included, except for panel 1, for which a reliable Ha/H/3 was required, hence excluding all PGs and most RGs. 



galaxies that do not appear in traditional diagrams (e.g., BPT) be- 
cause they lack some of the diagnostic lines. This is possible us- 
ing the WHAN diagram (W H „ vs [NiiJ/Hq-) that we proposed in 
earlier work (CF10). An additional problem with the BPT dia- 
gram and other similar line-ratio diagrams is that, as shown by 
Stasihska et al. (2008), the LINER region contains a mix of two 
completely different families of galaxies: galaxies that contain a 
weak AGN and "retired galaxies", i.e. galaxies that have stopped 
forming stars and whose emission lines are powered by their hot 
evolved low-mass stars (HOLMES). A truly comprehensive classi- 
fication scheme must account for this phenomenon, and the WHAN 
diagram allows so. 

To distinguish true AGN from fake ones (i.e., the RGs), we 
have shown that a useful criterion is the value of £ , which measures 
the intrinsic extinction-corrected Hey luminosity in units of the He 
luminosity expected from stellar populations older than 10 s yr un- 
covered by our stellar population analysis software starlight. In 
principle, £ should be equal to one if the galaxy absorbs all the 



ionizing photons provided by its HOLMES, and > 1 if any other 
source of ionization is present. We have shown that this index is bi- 
modally distributed, with a £ » 1 population composed by galax- 
ies undergoing vigorous star-formation and/or nuclear activity, and 
a population of galaxies centred right at the predicted value for 
RGs. Computing £ for each galaxy, however, is not a trivial task: 
it requires a stellar population analysis able to identify the popu- 
lations of HOLMES and a recipe to obtain the ionizing radiation 
field from these HOLMES. The latter strongly depends on how the 
(yet quite uncertain) evolutionary tracks for post-asymptotic giant 
branch stars are incorporated into the stellar population codes as 
well as on other issues such as the IMF and the metallicities of the 
stellar populations. 

It is therefore preferable to base the distinction between true 
and fake AGN on the Ho- equivalent width, an easy to obtain obser- 
vational parameter which provides an excellent proxy for (;. In view 
of the strong dichotomy in the observed W Ha distribution for galax- 
ies in the right wing of the BPT diagram (i.e., the wing previously 
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supposed to contain only AGN), we propose a practical separation 
between weak AGN and RGs based on a class frontier at Wn„ = 3 
A. This physically inspired but data-guided separation corresponds 
to the minimum between the two modes in the W Hff distribution. 
We thus identify 5 classes of galaxies in the WHAN diagram: 

• SF: Pure star forming galaxies: log [Nul/Ha < -0.4 and 
W Ha > 3 A 

• sAGN: Strong AGN (i.e., Seyferts): log [Nn]/Ha > -0.4 and 
W Ha > 6 A 

• wAGN: Weak AGN: log [Nn]/Hor > -0.4 and 3 < Wn„ < 6 A 

• RG: Retired galaxies (i.e., fake AGN): Wn a < 3 A 

• PG: Passive galaxies (actually, line-less galaxies): WW and 
W[nh] < 0.5 A 

Note that the border between RGs and PGs is somewhat arbi- 
trary, but this is not really a concern. 

We have then analysed the star formation histories of the dif- 
ferent classes of galaxies, as well as the distribution of a dozen 
of physical or observational properties, such as the galaxy stellar 
mass M+, surface densities /i*, velocity dispersions cr+, and stellar 
extinctions A v obtained through starlight. All these comparisons 
corroborate our proposed differentiation between RGs and wAGN 
in the LINER-like family. We also find that wAGN have properties 
overlapping with those of sAGN, while RGs line up with PGs. In 
other words, wAGN are the low activity end of the AGN family, 
while RGs are passive galaxies with emission lines. As a matter 
of fact, both RGs and PGs are "retired" or "passive" galaxies, the 
real spectroscopic difference between them being the presence or 
absence of emission lines. It is only the inherited custom which 
imposed the adopted nomenclature. 

Now that these emission line classes have been established, 
further work could be to understand what is at the root of the 
strength of the AGN phenomenon or why some passive galaxies 
present emission lines and others do not. 
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